Journal article

Boosting association rule mining in large datasets via Gibbs sampling

G Qian, CR Rao, X Sun, Y Wu

Proceedings of the National Academy of Sciences of the United States of America | Published : 2016

Abstract

Current algorithms for association rule mining from transaction data are mostly deterministic and enumerative. They can be computationally intractable even for mining a dataset containing just a few hundred transaction items, if no action is taken to constrain the search space. In this paper, we develop a Gibbs-sampling-induced stochastic search procedure to randomly sample association rules from the itemset space, and perform rule mining from the reduced transaction dataset generated by the sample. Also a general rule importance measure is proposed to direct the stochastic search so that, as a result of the randomly generated association rules constituting an ergodic Markov chain, the overa..

View full abstract

University of Melbourne Researchers